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^ " Abstract 

■ We study the ratio ergodic theorem (RET) of Hopf for group actions. Under a 

I/") ■ certain technical condition, if a sequence of sets {F n } in a group satisfy the RET, 

- then there is a finite set E such that {EF n } satisfies the Besicovitch covering 

property. Consequently for the abelian group G — ©J^Z there is no sequence 
\ F n C G along which the RET holds, and in many finitely generated groups, 

,£h ' including the discrete Heisenberg group and the free group on > 2 generators, 

there is no (sub)sequence of balls, in the standard generators, along which the 
RET holds. 

On the other hand, in groups with polynomial growth (including the Heisen- 
berg group, to which our negative results apply) there always exists a sequence of 
" balls along which the RET holds if convergence is understood as a.e. convergence 

g: in density (i.e. omitting. s e q „ence of density „„). 

1 Introduction 

t-h ■ Let G be a countable group acting from the left by measure-preserving transformations 

on a measure space (X,B,fJL), with the action of g € G on x G X written x i— > T 9 x. 
^ " We assume the action is ergodic. For a finite set F C G and (p : X — > R let 

Si 

gdF 

The asymptotic behavior of SF n (*p) as F n exhausts the group, in some sense, is the 
subject of the ergodic theorem. In this paper we are interested in the situation when \i 
is an infinite (without loss of generality cr-finite) measure, in which case the appropriate 
quantity to consider are the ratios 
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One says that G satisfies a ratio ergodic theorem along {F n } if for every ergodic 
measure-preserving action of G on a non-atomic measure space, and every ip,i() € L 1 ^) 
with J tpdfj, 7^ 0, we have 

R n (ip,ip) -> J r fi-a.e. 
J ipdfj, 

For G = Z and -F n = [l,n] n Z the ratio ergodic theorem was proved by Hopf 
in 1937, only a few years after the ergodic theorems of von Neumann and Birkhoff. 
Unlike the latter theorems, however, which have been extended to very general classes 
of groups (see [10J for a recent survey), extensions of Hopf's theorem have been slow 
to appear. Part of the reason is that, for a time, it was believed that no extension is 
possible, due to an example of Brunei and Krengel [8], who showed for Z d , d > 2, that 
ratio ergodic theorem fails along F n = [0,n] d . Nevertheless there is a ratio ergodic 
theorem for actions of Z d along symmetric cubes F n = [—n,n] d n Z d . This was first 
proved by Feldman under additional assumptions on the dynamics of the action [3], 
and we proved the general case in [7] (also for some more general sequences -F n )0 

The prospects for groups other than Z d has remained unclear. A pertinent fact 
from [7] is that a certain maximal inequality, which is central to most existing proofs, 
is actually equivalent, in the present context, to {F n } satisfying the (right) Besicovitch 
covering property (Definition 12.11 below). Sufficiency of this property was observed 
earlier by Becker [2] (see also [HI [9]). The Besicovitch property is quite rare, and its 
failure puts into question the validity of the ratio ergodic theorem for many groups. 
But, on the other hand, as far as we know the maximal inequality and the ratio ergodic 
theorem are not equivalent. Indeed for 7L d it took several decades to prove the latter 
once the former became available. 

In this paper we present two main results. The first shows that, indeed, the ra- 
tio ergodic theorem is quite rare, and is closely linked to the Besicovitch property. 
The second, on the other hand, shows that a certain weakening of it does hold more 
generally, including in groups where the strong version above fails. 

We begin with main negative result, which requires the following definition. Let us 
say that {F n } is almost central if for every g £ G there is a finite set E with F n g C EF n 
for all n (here and throughout we write AB = {ab : a £ A , b E B}, etc.). This holds 
trivially in abelian groups, and also for balls in any finitely generated group. 

Theorem 1.1. If {F n } is almost central and satisfies the ratio ergodic theorem, then 
there is a finite set E such that {EF n } is Besicovitch. 



1 Bowen and Nevo have also recently obtained a variant of the ratio ergodic theorem for free groups, 
but with some additional randomization which makes the problem somewhat different. 
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For a marginally stronger statement see Theorem 12.71 It seems possible that the 
Besicovitch property is necessary in general for a ratio ergodic theorem, but this re- 
mains open. 

We give two main applications. First, 

Theorem 1.2. Let Z°° = ©J^Z. Then the ratio ergodic theorem fails along every 
sequence F n C Z°°. 

The group Z°° is extremely "nice" - it is abelian, amenable and residually finite. 
As such it is quite surprising that the ratio ergodic theorem should fail. However one 
might object that it is "infinite-dimensional" . So suppose now that G is generated by 
a finite symmetric set A and let B n = A n be the associated "balls". 

Theorem 1.3. Let G be finitely generated and {B n } the sequence of balls with respect 
to some finite generating set. Suppose that no sequence of balls has the Besicovitch 
property. Then the ratio ergodic theorem fails along every sub-sequence F\ = B n u\ . 

In particular, in the discrete Heisenberg group with the standard generator set 
there is no sequence of balls which satisfies the ratio ergodic theorem (see Section I2.3P . 
The same holds for free groups on > 2 generators. In these examples we cannot yet 
rule out the existence of some other sequence along which it does hold, and one should 
note that there are finitely generated groups, such as the lamplighter groups, for which 
balls are not the right averaging sets to consider. But for groups of polynomial growth 
all known ergodic theorems do hold along balls, and it would be quite surprising if 
some other good sequence exists. 

We turn now to our positive result which shows that, if one accepts a slightly weaker 
notion of convergence, then there is a version of the ratio ergodic theorem which holds 
in greater generality. Recall that a group G has polynomial growth if the balls B n 
satisfy \B n \ < c'n c for constants c,c'. Define the (upper) density of a set I C N of 
integers by 

d(I) = limsup— \ID [1,N]\ 

N^oo A 

A sequence a n converges to a in density if 

d(n : \a — a n \ > e) = for every e > 

We denote this limit by a n A a or d-lima n = a. This operator satisfies all the usual 
properties of limits. 

Theorem 1.4. Let G be a group of polynomial growth and B n as above. Then there 
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is a subsequence Fi = B n u\ along which the ratio ergodic theorem holds in density, i.e. 



for any ergodic measure-preserving action of G and any (p,ip £ L l (n) with J ifidfj, 7^ 0. 

The proof is given in Sectional Thus there are cases, such as the discrete Heisenberg 
group, where no sequence of balls satisfied the ratio ergodic theorem, but there exist 
sequences along for which the density version is valid. Our arguments are special 
to groups of polynomial growth but some parts carry over also to groups of sub- 
exponential growth (groups with \B n \ = o(c n ) for all c > 1). We do not know if and 
when similar modifications of the ratio ergodic theorem hold in more general groups. 

There are other possible weakenings of the ratio ergodic theorem. One possibility is 
to require a.e. pointwise convergence of the ratios to other limit functions. This 
phenomenon has been recently observed in certain algebraic settings involving "large" 
groups acting on infinite measure spaces, see e.g. the introduction of [3jj§ However, 
our negative results exclude this as well; in the proofs we construct actions for which 
the ratios diverge. 

The rest of the paper is divided into two sections: In Section [2] we develop the 
necessary combinatorics and prove Theorems 11.31 11.21 and 11.11 We prove Theorem 11.41 
in Section [3l 



2 Besicovitch is necessary 
2.1 Combinatorial preliminaries 

In this section {F n } denotes a sequence of finite subsets of G, all containing the identity 
element 1q. We begin with some combinatorial definitions. 

A collection of subset of G is said to have multiplicity k at a point g if g 

belongs to k of the sets. The multiplicity of {E{\ is the smallest k such that all points 
have multiplicity < k. The following definition is classical in analysis where, instead of 
translates of sets in a group, one considers balls in a metric space. 

Definition 2.1. {F n } satisfies the Besicovitch covering property (or, more concisely, 
{F n } is Besicovitch) if there is a constant C such that, for every finite ACG and any 
family of sets of the form {F n ^g} g£ A there is a subset A' C A such that {F n r g \g} g& A' 

2 1 am grateful to Amos Nevo for drawing my attention to this phenomenon. 



4 



covers A and has multiplicity < C; equivalently, 

U < lF n( 9 )9 < C 

g&A' 

It is easy to see that any finite sequence {F n }^ =1 is Besicovitch and that {F n }^ =1 is 
Besicovitch if and only if {F n }^ = is Besicovitch for every uq. In this section we rely 
primarily on the following characterization of the Besicovitch property. Define a (right) 
incremental sequence to be a finite sequence (F n ^gi)^ =1 such that gj ^ Ui<j Fn(i)9i 
and n(l) > n(2) > ... > n(k). 

Proposition 2.2. If {F n } is not Besicovitch then for every k there is an incremental 
sequence of multiplicity k (equivalently, with 1q belonging k members of the sequence). 
If in addition F n are symmetric and increasing, the converse holds. 

Proof. We include the standard proof for completeness. If {F n } is not Besicovitch then, 
given k, there is a family {F n ^gi}f =l such that any sub-collection covering all the gi 
is of multiplicity k. We may assume n(i) are non-increasing. Choose an incremental 
subsequence {F n ^ m ^gi m } greedily: let i\ = 1, and if i±, . . . , i m are defined take i m +\ to 
be the minimal i > i m satisfying gi £ \Jj< m F n {i^giy The resulting sequence covers all 
the gi (here we use \q £ F n ), thus has multiplicity k. We can assume the multiplicity 
is realized at 1q by applying an appropriate right translation to the sets. 

In the other direction, given k let {F n u\gi\ be an incremental sequence of multi- 
plicity k. Thus gi (£ F^gj for all i > j; by symmetry gj £ F'^gi = F n{j) gi. Since 
n(j) > n{i) this shows that gj £ F n ^gi also for i < j. Thus this holds for all i ^ j, 
and the only sub-collection of {F n ^gi} that covers all the gi is the full sequence, whose 
multiplicity is k. Since k was arbitrary, {F n } is not Besicovitch. □ 

Our main interest is in sequences for which the Besicovitch property fails. We 
require a slightly stronger property: 

Definition 2.3. {F n } is strongly non-Besicovitch if for every finite E C G there is a 
finite set E C E C G with Iq €. E such that {EF n } is not Besicovitch. 

We note two situations where this property holds: first, when no sequence F' n 5 F n 
is Besicovitch (take E = E(J{1g})- Second, if G is finitely generated, B n are balls, and 
no sub-sequence of balls {B n ^}^ =1 is Besicovitch then every sub-sequence is strongly 
non-Besicovitch; indeed given a finite set E take E = B m , so that EB n = B n+m . 

The sets we consider later will also satisfy the following property, which was already 
mentioned in the introduction: 
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Definition 2.4. {F n } is almost central if for every g G G there is a finite set E C G 
with -F n g C E'E 1 ™ for all n. 

The two primary examples are when G is abelian, in which case we can take 
E = {ft}; and when F n = B^f n \ are balls, since then if g G B m then _F n c/ C B m F n . 
The latter example shows that this property does not actually have any connection 
to abelianness of the group, since it holds for balls in any finitely generated group. It 
is clear that the definition is equivalent to the statement that for every finite E C G 
there is a finite E' C G with F n E C E'F n for all n. 

We now derive some properties of strongly non-Besicovitch and almost-central se- 
quences. 

Lemma 2.5. Suppose that {F n } is strongly non-Besicovitch. For every finite E C G 
there is a finite E C G containing E, and such that for every finite H C G there are 
arbitrarily long incremental sequences {F n (i)9i}i=i satisfying 

(a) Egi n F n Q^gj = and Eg { n £?ft = for i > j. 

(b) E gi n ET = for all ». 

(c) E- 1 n F n(i)5i ^ for all i. 

Proof. Let E be given and assume without loss of generality that 1q G E. Let £7 be 
associated to E _1 E as in Definition [231 Let HOG be finite, let k = £ + \E~ l H\, and, 
using Proposition 12. 2\ choose an incremental sequence {EF n ^gj}^ =1 for {EF n } with 

i G ef\EF nU)gj . 

For j < i we have ft ^ EF n ^gj. Since £ B this implies ft ^ F n u\gj, hence 
{Fn(j)9jYj=i ls incremental. 

For (a), let i > j. Then ft ^ EF n ^ gj and E 1-1 C E 1 imply Eft n E^ft = 
0; similarly ft ^ EF n u\gj and 1 G G -F n (j), together with the definition of E, give 
.Eft n E 9j = 0. 

For (b), note that since £ F n , by the incremental property, all the gj are distinct. 
Egj n H implies ft- G E~ 1 H, so after removing at most lE 1-1 ^ elements of the 
sequence we are left with an incremental sequence of length £ which, in addition to the 
above, satisfies (b). 

(c) follows from Iq G EF n u\gi. □ 

Lemma 2.6. Suppose that {F n } is strongly non-Besicovitch and almost central. Then 
for every finite D,E C G there exists a finite H C G, with D C H , satisfying the 
following property: for every i there is an incremental sequence {E 1 n ( i )ft}| =1 such that 

(i) Egi n F n ^egj = and Egi n Egj = for all i ^ j and e G E. 
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(ii) EgiDH = for all i. 
(iii) F n u\egi D H ^ for every i and e G E. 

Proof. Let D,E be given, without loss of generality lg £ D n £. Using almost cen- 
trality let £?' be such that E'F n D F n (E U .ET 1 ), and assume 1q G E 1 ' (otherwise just 
add If? to it). Let = (E')~ 1 E, let £?" be as in the previous lemma, and apply 
the previous lemma to H = D(E')^ 1 (E")^ 1 . We obtain arbitrarily long incremental 
sequences {F n r£\9i} satisfying (a)-(c). Property (ii) is just (b). 

For (i), we already know that E"gi n F n ^gj = for i > j. Using the definition of 
E" this gives Egi n F n ^Egj = for i > j. This is the same as F~h.Egi n Egj = 
for i > j, which, by symmetry of F n u\, is just F n u\Egi n Egj = for i > j. Thus this 
relation holds for all i / j. (i) follows using 1^ £ Sn F n ^y 

For (iii), by choice of E' for every e S S we have i^me C E'F n u\, hence 
F n (i) Q E'F n u\e. Combined with (c) this implies that (E")~ l n E'F n u\egi ^ 0, hence 
{E')- l {E")- 1 n F n(i) ec/i ^ 0. (iii) follows since H D (E')- 1 ^')- 1 . □ 

2.2 Necessity 

In this section we add the assumption that the sets F n are symmetric, and continue 
to assume 1q G i^ri- It will be convenient to write Sf(^>,x) and Rf(<P,iP,x) instead of 
S F (ip)(x), R F {(p,ip)(x). 

Theorem 2.7. If {F n } is strongly non-Besicovitch and almost central then there is an 
ergodic measure-preserving action of G on a non-atomic measures space (X,B,fi), and 
functions ip,ip G L 1 ^) with f ip ^ 0, such that RF n (ip,ip) diverges a.e. asn-> oo. 

Theorem 11.11 is then a formal consequence of Theorem 12.71 since if {F n } is not 
strongly non-Besicovitch then {EF n } is Besicovitch for some finite set E. 

The construction that is the proof of Theorem 12 . 71 proceeds by cutting and stacking. 
We give full details below, but let us first give an informal overview for readers familiar 
with the method. Suppose we have defined a large "stack" whose shape a finite set E C 

G, and a pair of real-valued functions <p, ip > with , HV'lli < °°i corresponding 
to an R-coloring of Go- Applying the corollary to E and D = E we obtain a set 
H D E, and, fixing a large N, an incremental sequence {F m (i^i\f =1 with the associated 
properties. Now, cut the original stack into iV copies of equal mass, and translate them 
to Eji, i = 1, ... ,JV, which by the corollary are pairwise disjoint and disjoint from 

H. Add new mass to the sites corresponding to H (which is empty) in the new stack, 
and on it define ip to take very large negative value v, and define ip to be zero there. 
Also add new mass where necessary in Ut=i -^ 7 m(j)-E'7i) defining ip, Tp to be there. If 
v is negative enough in a manner depending only on the original stack, this forces the 
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ratios over F n ^eji for e G £ to be < -1; but the total change to \\<p\\ is \H\v/N, 
which can be made arbitrarily small by choosing N large. Iterating this procedure, we 
can cause the ratios at the points corresponding to the original E to fluctuate between 
> 1 and < — 1, and in the limit we obtain the desired counterexample. 

We now carry this plan out in more detail. First we describe the cutting-and- 
stacking scheme in the group context. Fix in advance the measure space (R, Lebesgue). 
We will define a compatible sequence of partial actions T n . By this we mean that: (i) 
for every g we define a sequence of maps T%, n = 1,2,... with increasing domains 
X n ,g Q R and which extend each other in the sense that T^ +1 |x n , 9 = Tj{\ (ii) for 
x G X, if both the expressions T%(T£x) and Tn 9 x are well defined (that is, if x G X n>g , 
T£x G X n fr, and x G X n ^ g ), then they are equal; and (iii) writing X n = {J ge aXn,g, 
for every x G X n and every g G G we have x G X m ^ g for some m > n. It is clear that 
this defines in the limit an action of G on X = |J n X n given by T 9 x = lim 

n — ^oo TyiX. At 

the same time, we will define (p n , ip n : X n ->Iina compatible way, giving functions 
<p, ip : X — > R in the limit. 

The formulation above is somewhat unwieldy and the construction itself will take 
the following form. At each stage n we will have defined a finite set G n C G and to 
each g G G associated an interval I n:9 = [a njfl , b n ^ g ) C R of length r n > 0, independent 
of g, and with n I n ,h = ® iov g ^ h. For x G I n , g the map is defined if hg G G n , 
in which case T^rc G ^ n ,7ig is the point a rij / l9i + (a„ i9 — x) that occupies the same position 
in I n>hg as x occupies in I n>g . Thus A n = \J g eG n and ^n,ft = {x G X n : x G 7„ ifl 
and % G A n }. What we have said so far ensures that (ii) is satisfied. To ensure 
properties (i) and (iii) we first describe the transition from stage n to n + 1, which is 
by "cutting and translating". Given G n , {In,g}geG„ f° r some n, we first choose a large 
and choose elements 71, . . . ,7n of G such that the sets G n ~a are pairwise disjoint. 
Now partition each interval I n ^ g into N intervals of equal length 

r n +i = r n /N 

Ordering these intervals from left to right, set I n+ i !gii to be the i-ih sub-interval. 

We have so far defined intervals for g G G' n+1 = Gilii an d one m &y verify that 
the compatibility condition (i) holds. To ensure (iii), fix a sequence r n C G of finite 
subsets increasing to G with 1 G Ti, and define G n +i to be any finite set containing 
T n G' n+1 ; to the new points g G G n+ \ \ G' n+1 assign arbitrary pairwise disjoint intervals 
In+i,g C R \ X n . 

We will define by induction G n , {I n , g } g eG n as above with associated partial action 
T n , and bounded functions ip,tp : X n — > R with H^l^ , H^l^ < 2. Furthermore we will 
have bounded functions i n : X n _\ — > N such that for every x G [0, 1] the maps T^x are 
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defined for all g G F in r x \, and 



> 1 n odd 
< — 1 n even 



where we define R n (ip,ip) as before in terms of the partial action T n . We also will 
ensure that i n (x) — > oo for x G^-l IJ A^. Assuming all this, it is clear that, for the 
action T defined in the limit, for every k the ratios R n (cp,ift,x) diverges for x G X^, 
and hence diverge everywhere on X = |J J„. One point we have not touched on is 
ergodicity of the limit action, we will come back to this below. 

It remains to describe the construction. At the first step we set G\ = {1g}j 
Ii,\g = [0)1]) so Ad = [0,1]; define (p,ip and i\ to be identically 1. Then all the 
requisite properties hold. 

Now suppose for some n we have defined G n , {/ ni9 } 9 gG„, r n , (p,ip, and i n as 
above. For simplicity we assume n is even, the odd case being the same. Let 
i*n = su PxeX n _i inix) < oo and 

3> n = sup \S F (ip,x)\ 
xe[o,i\ 

^ n = SUp \S Fin(x) 0>,x)\ 



and choose 



x-e[o,i] 



V = $ n + f n 



SO that (v - $ n )/^ n = 1- 

Let H be the set associated to D = E = G n in Lemma [2.51 Choose N large enough 
that 



\H\-v r n /N < 2 - / 
Jx, 

Applying the lemma, choose elements 71, . . . , 7jv and indices fci, . . . , k^ > i* such that 



Let 



H n F k . gij ^ for all j and g G G n (2) 

H n G n -fj = for all j (3) 

G n7i n Ffe^Tj/ = for j ^/ and 5 G G„ (4) 

G n 7inG„7 i ; = for j ^ / (5) 



N 

G n+1 = T n+1 [ (|J F^Gn-m) U if 

3=1 
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Assign intervals of length r n+1 = r n /N to the elements of G n+ \ as follows: First 
partition each I n ^ g into N intervals of length r n+ \ and for g G G n assign to h = 
g"fj the j-th sub-interval of I n ,gi which we call I n+ \^. So far there are no conflicts 
by ([5]) and the assignment consists of disjoint intervals. To the remaining elements 
h G G n+ \ \ UjLi Gn7j associate arbitrary pairwise disjoint intervals I n +i,h X n , 
ensuring that the entire family {I n +i,g}geG n+ i is pairwise disjoint. This can easily be 
done since X n C M is bounded. 

For x G X n n/ nj97j for some g G G n , define z n+ i(x) = fcj. Again, this is well defined 
by ©. 

On (J/igi? In+i,h set ip = and (p = v. There are no conflicts with previous 
definitions because of 

On the remaining mass, define ip = ip = for h £ H. There are no conflicts by @. 

Finally, in order to verify that R{ r x \(ip,ifi,x) > 1 for x G A n _i, note that, by 
dU, if x G I n ,g-yj f° r 9 £ G n then 

hence, by choice of v and ([2]), we have -Rf. ^(ip,ifj,x) > 1. 

While this construction ensures that the ratios diverge on a positive fraction of 
the mass of a positive fraction of the ergodic component of the action, these ergodic 
components may, a-priori, be atomic, whereas we require divergence of the ratios on a 
non-atomic space. The easiest solution is to introduce an intermediate step between 
the stages of the construction, during which we create a large stack with disjoint but 
very randomly placed copies of the previous stacks. It is standard to show that the 
resulting action is ergodic, and the new intermediate steps do not interfere with the 
construction above. We omit the details. 

2.3 Balls in finitely generated groups. 

Let G be a finitely generated and B n balls with respect to some symmetric generating 
set. These are symmetric and contain the identity and, as noted in the introduction, 
{B n } are almost central. As noted in Section [2TT1 if no infinite subsequence {-B n (i)}^i 
is Besicovitch then every such subsequence is strongly non-Besicovitch. This and 
Theorem Theorem 11.31 

The application to the Heisenberg group mentioned after Theorem 11.31 follows from: 



10 



Proposition 2.8. Let 
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denote the discrete Heisenberg group with generating set 



{a ± ,b ± }={ 
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Let {B n } be the associated sequence of balls. Then every infinite subsequence {B n ^} 
is non-Besicovitch. 



Proof. Let 



6" 1 o~ 1 6o 



and 



{C n }n 



{( 1 





(\ 
1 

\ o o 



m 



1 



m G 



I V o o i 

is the center of G. Using the commutation relation [6, a] = c _1 it is elementary to show 
that the set M r = {m : c m £ B^ r } contains gaps that grow arbitrarily large as r — > oo. 
Thus we can choose < s r ,t r £ M r such that (s r ,t r ) n M r = and t r — s r — > oo. 

Sru+n > r(i) 2 ), then 



If a sequence r(i) that grows quickly enough (e.g. if t 



r(i+l) ~ b r(i+l) 

is an incremental sequence whose elements all contain Iq- This proves 
the claim. □ 

Theorems 11.31 can be slightly strengthened using the following version of Theorem 

12771 

Theorem 2.9. In Theorem \2. 7\ if in addition F n = By n \ are balls in a group, then 
in the conclusion we may also assume that <p,ijj S L°°{fj). 

Proof. In the n-th stage of the construction, instead of setting ip = v only on the 
intervals associated to h € H, choose an appropriate m and set 99 = v/\B m / s \ on the 
intervals associated to h G B m . This m is chosen before N and the 7,, and we can 
ensure that G n ^i n B m = as in Lemma 12.61 by simply choosing an a-priori larger N 
and discarding some of its elements, so this does not interfere with the construction. 
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By choosing N large relative to m, the L -norm of ip still increases arbitrarily little. 
Now, since H n F n u)g^i 7= for g G G n and we can assume that m is large enough 
that H C B m /3, there is a ball B m /^'y' i C £? m n i^^j f° r some 7- (take 7^ to be the 
point on the midpoint geodesic from 7, to some element of D F n ^gjj). The proof 
now carries through. □ 

Finally, for finitely generated non-abelian free groups it is elementary that no se- 
quence of balls in the standard generator set is Besicovitch. We omit the proof. We 
do not know if this is true for every generating set, though it seems very likely that it 
is. 

2.4 Some general reductions 

We give here some simple reductions that will be used later. Write L\{n) = {/ G 
LV) = / >0}. 

Lemma 2.10. Let F n C G. The ratio ergodic theorem holds along {F n } if and only if 
for every action of G there is a 7^ i\) G L\, such that RF n (<p,ip) — > / <p/ f Y> for all 
<p G L\. 

Proof. One direction is obvious. For the other fix an action and suppose that there is 
a Y> as above. Convergence of R.F n {<p,ip) for ip G L\ implies it for all ip G L 1 , since 
the operators i?p n (-,^) are linear and one can break an arbitrary L 1 function into a 
difference of non-negative ones. Now for any cp,9 G L 1 with f 7^ 0, the conclusion 
follows by passing to the limit in the identity Rp n (ip,6) = RF n ((p,ip)/RF n (0,ip)- □ 

Lemma 2.11. There exists a sequence F n C G a/ong which the ratio ergodic theorem 
holds if and only if there exists such a sequence with, in addition, 1q G F n for all n. 

Proof. Only the "only if" direction must be proved. Suppose the ratio ergodic theorem 
holds along {F n }. First suppose some element g G G belongs to infinitely many F n . 
Let F n (j) be the infinite subsequence of sets containing g and write F[ = F^g^ 1 , 
so that 1g G F-, and ratio ergodic theorem holds along {F(} because of the identity 

It remains to deal with the case that no g is in infinitely many F n . In this case, 
by passing to a subsequence, we can assume that the sets F n are pairwise disjoint. 
We shall show that the ratio ergodic theorem holds along E n = [j i<n Fi. Since any 
g G E\ belongs to all of the E n , this brings us back tot he first case that was already 
established. 
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Thus, consider an action of G on a non-atomic measure space (X, B, fj) and <p, if} € 
L+(/j,). Since {F n } are pairwise disjoint, SE k {^) = ]Cfc=i ^F k (tp), hence 

Since Rp k {ip,^i) — > f cp/ f ip, we will be done if we show that SE n {i)) — > oo. To see 
this choose iCI and e > with fJ,(A) > and ip > el a- Choose another set BCI 
with fi(B)/fi(A) irrational. Then Rf u (1b,1a) are rational and converge a.e. to the 
irrational number fj,(B)fj,(A), so their denominators, which are 5^(1^), a.s. tend to 
oo with k. Hence SF n (ip) > eSp n (lA) — > oo, concluding the proof. 

We note that if (J F n = G then S E n (tp) — > oo follows directly from conservativity. 

□ 

Note that if we only assume the ratios to converge a.e. but not necessarily to to 
the limit J ip/ J 93, then the argument above still works assuming that Se u {^) — > 00 
for 7^ i\) G L\. This is the case if \}F n = G, for example, because ergodicity on a 
non-atomic space is the same as conservativity. 

We say that {F n } is generating if |J F n generates G as a group. 

Lemma 2.12. If {F n } does not generate then the ratio ergodic theorem fails along 

{Fn}- 

Proof. Suppose F n lie in a proper subgroup H < G and consider an ergodic action 
of G whose restriction to H is non-ergodic (e.g. a product measures on {0, 1} G with 
the shift action). Let X be the u-algebra of i/-invariant sets and choose functions 
ip,if) € L 1 that are constant on the atoms of I but K(ip\Z) /K(ip\I) is not constant. 
Clearly R Fn ((p,ip) = E(^|X)/E(^|X) for all n, so R Fn {<P,ip) / V?/ / V 1 - □ 

2.5 The group Z°° 

We now turn to G = Z°° and the proof of Theorem 11.21 switching to additive nota- 
tion. The main ingredient is Proposition 2.7 from [6]. In that paper the Besicovitch 
property is called incompressiblity (|6l Definition 1.9]), the two notions are the same 
by Proposition 12.21 

Proposition 2.13. If F n C Z°° are finite sets and {F n } generates, then {F n } is not 
Besicovitch. 

Combined with the fact that any sequence in an abelian group is almost central, 
the proposition above and Theorem 12.71 immediately implies that the ratio ergodic 
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theorem fails along every generating symmetric sequence F n C Z°° with 0^ S F n . We 
now show that the symmetry assumption is not necessary: 

Proposition 2.14. The ratio ergodic theorem fails along any generating sequence 
F n C Z°° with G € F n . 

Proof. Suppose that {F n } satisfies the ratio ergodic theorem and 1q £ F n . Then the 
same is true for {— F n }. To see this, given an action {T 9 } gS .z°° define an action T 9 = 
T~ 9 (this is an action because Z°° is abelian). Then Yl g e-F n i f{T 9 x) = ^2 g€ p n tp(T 9 x), 
and so the ratios over — F n with respect to T are the same as the ratios over F n with 
respect to T, and so converge as required. 

Now let E n = F n U (— F n ), which is a symmetric sequence with 0q £ E n . Define a 
probability measures v n on E n by 



and define R u (ip,ip) = S u (<p)/ S u (ip). Then for any action and (p, ij) as in the ratio 
ergodic theorem, 



While R Un 7^ Re„, on non-negative functions the two differ by at most a multiplicative 
constant of 4, since v n is equivalent to the uniform measure u n on E n with Radon- 
Nikodym derivative between 1 and 2, and Re„ = Ru n - Thus, if there is an action 
and functions ip,^ G L+, f ij? ^ 0, such that RE n ((p,ip) fluctuates wildly enough (e.g. 
lim sup / lim inf > 4) , then we have a contradiction to the convergence of R Vn . Since 
{En} is symmetric, contains 1q, and is almost central and strongly non-Besicovitch 
(see discussion preceding this proposition), such an action and pair of functions can be 
constructed using exactly the same scheme as in the proof of Theorem I2.71 We omit 
the details. □ 

Now, we have already seen that if there is some sequence along which the ratio 
ergodic theorem holds then there is also such a sequence that contains 0g (Lemma 



I2.1ip and generates (|2.12p . With these facts in hand, the proposition above proves 
Theorem II. 2i 




For a finitely supported probability measure v on G, let 





/i-a.s. 
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3 Groups of polynomial growth 



In this section we prove the ratio ergodic theorem "in density" for groups of poly- 
nomial growth (Theorem ll.4p . After defining the sequence F n C G in Section 13.11 
the proof follows the standard two-step scheme: in Section [3.21 we prove, for fixed vp, 
that RF n ((f,ip) converges to the proper limit on a dense family of functions <p E L 1 (a 
"Chacon-Ornstein lemma"), and in Section [3731 we extend to all <~p G L 1 using a suitable 
maximal inequality. Both parts use growth properties of G in an essential way. 

3.1 The averaging sequence 

Let G be a group of polynomial growth and B n = A n the balls with respect to some 
symmetric generating set A. By Gromov's theorem [5], G is virtually nilpotent, and 
a theorem of Bass [1] implies that there are constants c\,C2,c (moreover, with c G N) 
such that 

c\n c < \B n \ < c 2 n c 
Define the A:-boundary of B n to be 

dkB n = B n+k \ B n _ k 

We remark that it is easy to show that {B n } is a F0lner sequence, but we will not use 
this fact. 

We now define the subsequence F$ = B n u\ for which we will prove Theorem 11.41 
(the construction below can be perturbed in many ways to get a large class other such 
sequences). Let 

J m = [2 m ~\2 m )nZ 

We define the index sequence n(i) for i G J m , recursively in m = 1, 2, 3 . . .. For m = 1 
set n(l) = 1. Now assume we have defined for i G \J k<m J k - Let 

N(m) = n^™- 1 - 1) 

which is the largest value of n(i) defined so far, and set {n(i)}ieJ m ^° be the arithmetic 
sequence with \J m \ terms and gap 3iV(m), starting at 

L(m) = \J m \ ■ 3N(m) 

Thus n(2" 1 - 1 + i) = L(m) + i ■ 3N(m) for < i < 2 m . 

Note that {n(i)}i^j m C [L(m),2L(m)), hence N(m) < 2L(m — 1), and, since by 
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the last equation L(m — 1) = 2 m 1 • 3N(m — 1), we deduce that 

N(m) < (62 m - 1 ) m 
Having defined Fi = B n ^, for i G J m , set 

F t = B n (i)+N(m) 



and 



d*Fi — d N ( m -jFi 



Notice that F l + _ 1 Ud*F i <Z F+ and d*F i nF l t 1 =$- 
3.2 Convergence on a dense subset of 

For the rest of the section, fix an ergodic measure-preserving action of G on a c-finite 
measure space (X,B,(i). Given ip : X — > [0, oo) write 

Lemma 3.1. Lei 99 6e as above and x G X . Given e, 5 > suppose that N G J m and 
?7 C {1, . . . , iV} are suc/i i/iai > <5 and ^(x) > £ /or i £ U. Then, assuming m 

is large enough in a manner depending only on e, 5, 

S F +{v,x)>\F+\\{x) 

Proof. We suppress x in our notation. For i G U we have by definition that S'g* i?. (93) > 
eS F + {ip) and hence 5 F +(<^) > (1 + £)S F + (if). Since </? > 0, for any j < i we have 

i— 1 i i—1 

S F +(ip) > S F +(ip). Starting from S F +((p) and applying this recursively to the elements 
of U in reverse order, we have 

s f m > (i + £ )^-X + M 

N 1 

> (l + e) 5N -\ 

> + 

It remains to notice that 

l^jvl ^ \ B 2n(N) \ < Cn{N) c < CN{m + l) c < C{2 m ) cm = C2 cm2 
for some constant C depending only on C2, and that if m is large in a manner depending 
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on e, 6 then C2 2cn? < (1 + e) 52 " 1 1 . □ 

Theorem 3.2. Let (p G L 1 (fi) with ip ^ and </? > 0. T/ien ^ -4 v-a.e., where 
dv = (pdfx. 

Proof. Fix e > 0. It suffices to show that 

d(i : <fi(x) > e) = z/-a.e. 
Fix <5 > 0, which we suppress in our notation, and let 



E N = {x G X 



5 < <p(x) < 5 1 and tfi(x) > e for 
at least a effraction of 1 < i < N 



It is enough to show, for every 5 > 0, that v-a.e. x belongs to only finitely many En- 
We establish the last claim. Assume, as we may, that m is large relative to e, 5 as 
in the previous lemma. Let N £ J m . By invariance of /x, we have 



\F N \-u(E N ) = f 1 E N (T g xMT9x)d l i{x) 
J geF N 

= J Sf n (<P • 1e n ) dfi(x) (7) 

Suppose that g G Fn is such that T 9 x G En- By the previous lemma (applied to 
(poT 9 ) and the definition of En, 

S F +(<p)(T 9 x) > \F N \ 2 -p{T 9 x) 
> \F N \ 2 -S 

Since hg G (F^) 2 for every h G F^, we have shown that if N G J m then 

S^-l^JX) 5 (F + )2 M>o|F^| 2 

By definition of E N we have <p(T 9 y) < if l EN (T 9 y) / 0. Therefore 5f jv (v 1e n ) < 
\Fn\ • so, by the implication above, 

Sf n (<P ■ lfijv) < I -^at | • • 1 {s. ir+ . 2 ( ¥ >)>5|F^|2} 
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Integrating this d\i and using (|7|) and Markov's inequality, 

\F N \u(E N ) < \F N \ • S- 1 ■ fi (x : S (F + )a (ip) > 5\F N \ 2 ) 



< \F N \.8- 2 (\F N \-*. J S {F + )a (<p)dn\ 
= 5-*-\F N \- 2 -\(F+f\- J cpdfi 



Now by polynomial growth and the fact that (F^) 2 C B in ^ N ) we have 



\(F+) 2 \<C-\F N \ 



for a constant C depending on c%, C2, c, but not on m. Thus, we have shown 

, „ . C fipdu C f ipdu 



using the trivial bound \Fn\ > -/V. This is summable, so by Borel-Cantelli, z/-a.e. x 
belongs to finitely many En. □ 

Recall that a co-boundary is a function of the form ip = r — t 9 for some g £ G. It 
is said to be an L 1 -co-boundary if r G L l (ii), and positive if r > 0. As a consequence 
of the theorem above we obtain a Chacon-Ornstein type statement. Note that in what 
follows, cr-finiteness ensures that statements about strictly positive L 1 -functions are 
not vacuous. 

Corollary 3.3. Let r G with r > and Then for every g G G, 

d-lim Rp t (r — r 9 ,r) = \i-a.e. 

i— >oo 

Proof. There is an zq such that g G F^, and for i > ig we have 



|-R f .(t-tV)| < 



< 



S L 



where tj is defined as in ([6]), and we have used r > 0. From the theorem we conclude 
that RFi( T — t 9 ,t) A at rd/x-a.e. point. Since r > the measures fi and rdfi are 
equivalent, and the corollary follows. □ 

The next conclusion is standard from the previous one. 

Proposition 3.4. Given < ip G L 1 (fj 1 ), £/ie se£ o/ </? G L 1 (fi) such that Rp i {if,ijj) — > 
J if/ f ip a.e. is dense in L . 
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Proof. Let us say that r G L 1 is -^-dominated if < r < for some M = M(r). We 
claim, first that the convergence in the statement holds for ip = r — t 9 where g G G 
and r G L 1 is ^-dominated; and second, that the joint linear span of ip and the set of 
such (p is dense in L 1 . The two claims prove the proposition since the limit in question 
holds trivially when p> = ip and the operators R Fi (-,ip) are linear. 

For the first statement, let ip = r - t 9 with < r < Mip. Then \R Fn (r,ip)\ < M, 
hence by the previous corollary, 

R Fi (r - t 9 , V) = Rf, (t-t 9 ,t)- R Fi (r, 4 

For the second statement, observe that since ip > 0, the set of differences of ip- 
dominated functions is dense in the positive cone of L , It follows easily that the 
linear span of the set of co boundaries ip = r — t 9 with r a ^-dominated function is 
dense among all L^co-boundaries (note that in general a co-boundary splits into the 
difference of two positive co-boundaries). We now refer to the standard fact that, for 
ergodic actions and assuming J ip^ 0, the linear span of ip and the L 1 -co-boundaries 
is dense subspace of L 1 ^) (see e.g. [1]). □ 

3.3 A density version of the maximal inequality 

The next step is to prove a maximal-type inequality that will allow to go from the 
d-convergence of R Fi ((p,ip) on a dense set of ip £ L 1 ^) to all of L (fi). Define the 
density-limsup by 

d-limsup a n = inf{t G R : d(n : a ri > t) = 0} 

n— ¥oo 

Lemma 3.5. If a n : X — > M are measurable then a = d-limsup a n is measurable. 

Proof. It suffices to show that 5t(x) = d{n : a n {x) > t) is measurable for each fixed t, 
and this is obvious since St(x) = limsup Yln=l ^{a n >t}{ x )- D 

Theorem 3.6. Let ip, ip G L l {n) with tp,ip > and f ipdn / 0, and write dv = ipd^x. 
Then 

v fd-hmsup-Ri^,^) > t] < 



Before giving the proof of the maximal inequality, let us use it to complete the proof 
of the ratio ergodic theorem. Fix < ip G L 1 (/x); by Lemma 12.101 it suffices to prove 
R F n {¥,ip) -> If/ J V> for p G L 1 ^). Noting that R Fn {f ~ cip,ip) = R Fn {p,ip) - c 
and setting c = f ip/ f ip, we may further assume that f <pdfi = 0. Let e > and let 

ip' G L 1 ^) be such that R Fn (p',ip) 4 and \\ip — p'\\ l < e, as exists by the previous 
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proposition. By Theorem 13.6 



u( d-limsup R Fn (\(p-(p'\,i;) > IV^] < !f 1,1 < C^l 

\ n— >oo "J V e 

and by the triangle inequality and R Fn ((p',ip) A 0, 

i/ I d-limsup \R Fn ((p,ijj)\ > y/e) <v[ d-limsup \R Fn (\(f - tp'\,ip)\ > \fe J < C-y/e 

\ n— "too J \ n—toc J 

From this we conclude that 

v ( d-limsup \R Fn (ip,ip)\ > ) = 



Since ip > the measures v and // are equivalent, so this is the same as Rp n (ip, ip) — ¥ 
/i-a.e., as desired. 

Turning to the maximal inequality, we will use the following Besicovitch-type prop- 
erty: 

Lemma 3.7. There is a constant C such that for any k,N, if {-B r (j)<7j}f =1 is an 
incremental sequence such that r(i) £ [N,2N], then its multiplicity is at most C. 

Proof. From the assumption B\ r uy2}9i are pairwise disjoint sets of size> c\{N/2) c . If 
h £ f] ieI B r ^gi for some I C {1, . . . k} then -Br r ( i )/ 2 ]fl , i B^jyh for i £ I, and the 
maximal number of such balls is therefore |I?3jv/i|/|-B[jv/2]I- Since |-B3jv| < C2(3./V) c we 
have | J | < 6 c C2/ci. □ 

Next we apply a variant of the Vitali covering argument. 

Lemma 3.8. Let a > 0, let (p,rp : X — > [0, oo) ara<i Zei ./V 6e given and E C i*jy. 

Suppose that for each g £ E there is an 1 < < N such that ipj^(T 9 x) < e and 
ijjj( g -)(T 9 x) < e, where <fi,ipi are defined as in Also suppose that Rj( g j(ip,ip,T 9 x) > 
a. Then 

Proof. Let E m = {g £ E : n(g) £ J m }. Let M be the maximal value of m for which 
E m 7^ 0- Define E' m C E m recursively starting from m = M and working down to 
n = 1: assuming we have defined for k > m, define E' m = {g m> i, ■ ■ ■ ,9 m ,e(m)} to be 
a maximal sequence satisfying the property in the hypothesis of the previous lemma 
with respect to r(g) = n(i(g)), and also satisfying g { Ufc> m Ui=i B r(g ktt )9k,i- 
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It is easily seen by induction that 

t{k) 

E m <^\J |J Fj(g kti )9k,i 

k>m i=l 

For he E m let 

t{k) 

F m h = F m h \ U \J F 3( 9 u,i)9k,i 

k<m i=l 

so that \J heE F 'j(h)h = UheE F j(h)h- Given g £ G and m, by the previous lemma g 
belongs to at most C of the sets Fj(g m i)gm,i, therefore and if mo = rrio(g) is the least 
index such that this is true for some j and g G F j{g mi )9m,i then g belongs to F'-^h 
only for some h £ £" mo (but no elements h G E m > for m' / mo), and to at most C such 
sets. It follows that 

e(m) 

J>( T ^ C EE £ Y>(T%) 
<?e£ m i=i geF' Smii 

Now by our assumptions about ipi(T 9 x) and ipi(T 9 x) for g £ E, and the fact that 
Fj( h jh \ F'-^h C d*Fj( h ^h, we conclude that for all m and 1 < z < £(m), 

< a' 1 f{T g x) 



9&F 3(9 m ,i)9ra,, 



a' 1 



^ t— E ^ T9x ) 



1-e 

Combined with the previous inequality, this gives 

_! <(m) 

m i=l geF' g mA 
Ca~ l . „ x 

< jzrj E 

Ca- 1 „ . , 

< — -5 F 2^,x) 

because </? > and F/vF C ; the claim follows. □ 
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Proof of the maximal inequality (Theorem \3. 6\) . Since it suffices to prove the claim 
with <p replaced by <p + pV f° r arbitrarily small p, we can assume that ip > 0. Write 
R = d-limsup Rp i (ip, ip). Fix t > and denote 

S = {x : R(x) > t} 

For 5 > let 

5,5 = {x : (ip, ip, x) > t + 5) > 5} 

Since 5 = \Js>o $s an d the union is monotone, it suffices for us to show that v(Ss) < 
/ ip/t for a constant C independent of 5. By Theorem 13.21 for cpd[i-&.e. x E Sg we 

have </9j(x) A 0, with </?j as in ([6]). Since ip,ip > the measures (/?dp and f = ipdp are 
equivalent, so this is also true v-a.e., hence the set 

S'$ = {x : d(RF i (p,ip,x) >t + 5 and <Pi(x) < 8) > 5} 

differs from S$ on a set of z^- measure 0, and it suffices to bound v(S' s ). Now, since v is 
a finite measure, there is an N such that 

S' SN = {x : Rp t ((p, ip,x) > t + 5 and ipi(x) < 5 for some 1 < i < N} 

satisfies 

HS' S , N ) > \v(S' s ) 

and so it suffices to bound the measure of S' SN . 

This now is a direct application of the transference principle and the previous 
lemma. We have 

\ f nW(S's, n ) = j s FN (i>i s , sN )dn 

For x £ X let 

E = E x = {g € F N : T»s € S' S>N } 

and for g £ E define i(g) = i x (g) G {1, . . . , iV} to be an index such that RF j{g) (p, ip, T 9 x) > 
i and ifi(T 9 x) < 5. In this notation, 

g£E x 

and we may apply Lemma 13.81 at each x, concluding that 

\FnHS' s>n ) < j^y t J S F 2 N (p)dp, = T^yl^l fvdfi 
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The conclusion now follows from the fact that by polynomial growth, |.Fjy|/|.Fjv| = 
\B2n(N)\/\Bn(N) \ is bounded uniformly in AT. □ 
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